Summary
We are looking for a Linguist to help us develop language components for a variety of voice-enabled technologies and products. We are seeking candidates with native or near-native fluency in French, Spanish and/or Italian with strong linguistic data analysis and language technology experience to manage data collection, data synthesis and data annotation tasks, translation-localization and ML model improvements.
Job Responsibilities
• Provide linguistic expertise in the areas of syntax, semantics, pragmatics and sociolinguistics
• Collaborate with other linguists and data operations teams in data collection, data curation, translation, localization and annotation efforts
• Create annotation systems and guidelines
• Evaluate and curate data sets for ML models
• Assess model and data quality
• Collaboratively develop complex and consistent linguistic analyses
Required Qualifications
• Master’s degree in general Linguistics or Linguistics with an emphasis on Romance languages, Computational Linguistics, Speech Science, or related field
• Native or near-native fluency in French, Spanish and/or Italian
• Awareness of French, Spanish and/or Italian linguistic, cultural, local norms
• Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and other areas of linguistics
• Experience working with speech and text data in multiple languages
• Familiar with Large Language Models (LLMs) and their applications
• Comfortable working in a fast paced, highly collaborative, dynamic work environment
• Strong organizational skills and detail oriented
• Excellent communication skills both verbal and written
Preferred (additional) Qualifications
• PhD in Linguistics or Romance languages, language technologies, computational linguistics, speech science, or related field
• Experience with database queries and data analysis processes (i.e. SQL, spreadsheets, R, Unix, or others)
• Proficiency in Python
• Experience with machine learning frameworks, NLP Libraries and Tools